rank | frequency | n-gram |
---|---|---|
1 | 5087 | -ा |
2 | 4848 | -ी |
3 | 3631 | -र |
4 | 3328 | -ं |
5 | 2524 | -न |
rank | frequency | n-gram |
---|---|---|
1 | 2030 | -ों |
2 | 786 | -री |
3 | 676 | -या |
4 | 674 | -ार |
5 | 665 | -ता |
rank | frequency | n-gram |
---|---|---|
1 | 526 | -यों |
2 | 373 | -िया |
3 | 371 | -िंग |
4 | 305 | -रों |
5 | 286 | -ted |
rank | frequency | n-gram |
---|---|---|
1 | 425 | -ियों |
2 | 240 | -ated |
3 | 189 | -ियां |
4 | 90 | -ारों |
5 | 87 | -सिंह |
rank | frequency | n-gram |
---|---|---|
1 | 240 | -dated |
2 | 76 | -रियों |
3 | 58 | -तियों |
4 | 51 | -ेंद्र |
5 | 49 | -ेंगे। |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings